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ABSTRACT 

Attention is drawn to the ways in which current ■ 
conceptions cf intelligence and its measurement differ from those 
which were generally accepted in 1928. The follcwing pnLnciples 
umd^erlying intelligence testing were generally agreed upon in 192S; 
(1)^ the assumption of intelligence as a recognizable'^ attributer 
responsitle fcr differences among children and adults in learning^ 
reascningr and othfer. Cognitive capacities: (2) the principle that 
sairpling apprcpriate mental tasks and norming scores against 
distribution in the population yields IQs acceptable as quantitative 
measures of level cf intelligence: and, (3) essentially inna-te and 
genetically determined, intelligence develops witii age, regardless of 
t^he environment,^ These statements are contrasted to 1978 thinking r' 
and the fcllcwing common criticisms of intelligence testing are 
highlighted: most item types are developed haphazardly without clear 
rationale: group tests are often given by untrained lay people under 
poorly ccntrclled conditions: test results depend on practice and 
motivation: the tests are unreliable, and, the pubj^ic feels 
threatened by tests. The remainder of the paper Us a looJc at recent 
investigations supporting genetic/eifvironmental factors on 
intellectual growth, (6K) 
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PHILIP E VERNON 



That the Fellowship awarded by the Scottish Council for 
Research in Education — an honour restricted to Scots or those 
who have worked tn Scotland — should have been made to one 
who last worked in Scotland in the niid-1940s may at first sight 
be surprising. To those who ^ave come under hts influence, 
however,, it can be no" suiprtse ' that Philip Vernon was the 
immediate and unhesitating choice of the Scottish Council for 
Research in Education as. the first one to receive its senior 
honour. The paper- contained in this booklet stands as ftvidence of 
the rightness of this choice. . ■ ' ^ 

Philip Vernon's time in Scotland was spent in Gb^gow. He first 
went there in 1935 after a most outstanding studentship at St 
John's College, Cambridge, and at Harvard, and after a period as 
a psychologist at the Maudsle^ Hospital* London. (His teachers 
had included Spearman* 3urt,' MacDougalli Stout* C Bartlett/ 
and AUport.) And it was while he was Principal Lecturer in ^ 
Psychology at Jordanhjll Training College that he produced his 
first bcK)k, The Measurement of Abilities (1937). Though the war 
' took him away from Glasgow to work, on personnel selection in 
the Armed Forces, he returned there for a short time after the war. 
making hts mark in the Department ot Psycholo^ at Glasgow' 
University. On leaving Glasgow he again worked with the 
Services as«a senior psychologist in the Admiralty* but he soon 
moved back to academic life when — in 1949 — he was 
appointed to the Chair of Educational Psycholo^ in the London 
University Institute of Education. In 1968 he left London for 
Canada* where, for ten years, he was Professor of Educational 
Psychology in the University of Calgary. He has recently retired 
from hts university duties there. 

Th^ present paper, though following a line of publications 
devoted to ttie critical summarisation of the work of others, repre- 
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sents only one side of hts contribution to educational psychology. 
He has shown ori^nality of mind in work ran^ng from intelli- 
gence tests (produced while he was still' a student at Cambridge) 
and studies in psychology of music (while a PhD student) to his 
mature work on the causes — social^ political, and educational — 
of bi^kwardness in many societies.* Not least amongst hts con- 
tributions to education has been his influence on students: the 
intellectual rigour of the present paper gives some indication of 
why that influence has been so great. 



* Intelligetux and Cuiturai Euvironmem (1969), 
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It^TELLIGENCE TESTING 1928-1978: 
WHAT NEXT? 



1 particularly appreciate the honour of being invited to give the 
first lecture celebrating the Fiftieth Anniversary of the founding of 
the Scottish Council for Research in Education* sincfc 1 lived and 
worked tn. Scotland from 1935-19^7 (apart from four of the war 
years). And though 1 ,was not directly connected with the 
Council's work, I knew Dr Rusk and Sir Godfrey Thomson very ' 
wellf and greatly admired the steady stream of researches which 
the Council encouraged and published. While 1 was naturally 
most interested in the Scottish Mental Survey volumes, it seemed 
to me valuable that historical and other^pes of educational 
research should be fostered; And in recent years, during which . 
more of the research has been conducted by professional staff* 
less by individual educationists, I have welcomed the greater 
emphasis on problems connected with school organisation and 
curriculum rather than on psychometric studies. 

My choice of title is not meant to imply that 1 am ^oing to give 
a detailed historical account of developments in ttiental testing 
over 50 years. Rather 1 wish to draw attention to the ways in 
which current conceptions of intelligence and its measurement 
differ-from thos^ which were generally accepted in 1928. Also the. 
title refers to the fact that.l myself first became interested in this 
field in 1928. as a PhD student at Cambridge University, atid 
produced my own first intelligence test for high-grade secondary 
and college students. TiSus these SO years also cover my own 
active research and writing on the topic. ^ ^' 



In 1928> Spearman, Burt, . Drever and Thomson reigned 
supreme in British psychology. The Stanford*Bihet scale for 
children had come into general use; group tests had demonstrated 
their \a\ut in the American army; and many such tests ^s the 
Moray House series were available for children. Despite the bitter 
controversies between Spearman and Thomson* there was fairly 



general agreement on the following principles underlying intel- 
ligence testing/ 

Ftnfit it was assumed that intelligence is a recognisable attribute 
which is responsible for differences among children and adults in 
their learning, reasoning, and other cognitive capacities. It is an 
homogeneous entity or mental power which, like height or weight, 
can vary in amount, or in rate of growth or decline, bt^t is essentially 
stable in its nature throughout life. Secondly, although obviously 
it is not measurable in tl^e same sense as physical attributes like 
height, yet the principle of sampling appropriate mental tasks and 
standardising or .norming scores against the distribution in the 
general population yields IQs which can be accepted as quanti- 
tative measures of level of intelligence. Thirdly, intelligence is 
essentially innate, being determined by the genes that the child 
inherits from his parents; hence it develops or matures with age, 
irrespective of the environment in Which he is reared, "It reaches 
its maximum by around 15 years and thetnstays^'constant until 
senility sets in. Thus the IQ obtained fr^m a reliable intelligence 
test in chUdhood indicates the educational and vocational level 
that the person can be' expected to attain in his later school caieer 
and in adult life, Burt, writing on general intelligence in 1933. 
said: ''Fortunately It can be measured with accuracy and ease'*. 

But in 1978 all these statements, though perhaps containing 
some grain of truth, would be hotly contested by the great 
majority of psychobgists. How is it that the testing' movement, 
long regarded as, a major achievement of applied psychology* and 
accepted by most laymen as veridical, is now so widely 'distrusted 
and criticised, and is even in some danger of abolitidi^ in the 
United States* where it once flourished most luxunantly?''Severa1 
States have passed* or at least considered* laws to ban the use of 
IQ tests in schools, on the grounds that they are culturally biased 
and do not accurately measure intelligence. Many American 
parents have successfully challenged in the courts the allocation of 
.the?r children to special schools or classes on the basis of low IQs, 
It has also been ruled in some suits that employers cannot refuse 



* This, and some subsequent, paragraphs are largely quoted Troni a Ibrthcoming 
book: Itttettigettce: Heredity Qnd Environment* to be published by W H 
Ffventan, San Francisco, 
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to employ blacks or others who obtain low t^st scores* unless 
there is clear evidence that suitability for the job depends on \vhat 
the. tests nieasure. The main attack has been on lar^e-scale group 
testing, and there is less interference as yet with the use of 
individual tests for clinical diagnostic purposes. But the latter too 
has been criticised, and some school psychologists i have ^been 
forced to substitute the fllinois Test of Psycholinguistic Abilities, 
or tests of concept formation or of Piagetian stages, which 
measure much the same, thing rather less effectively and con- 
veniei^tlj^ but which avoid tlie naughty word ^^intelligence", I need 
hardly mention the similar decline in group intelligence testing in 
the UK/with the virtual demise of the 1 1+, 

In the early 1920s thd<:average scores of American army recruits 
of different national or ethnic descent showed that men of Anglo- 
Amencan or Northwest ""European descent exceeded those of 
^^outhem and Eastern European stock; and American negroes 
scored lower still. But the ; objection was soon raised that these 
differences resulted from the e^qonomic and educational advance- 
ment of the different groups rather than from- innate differences 
in ability. In 19^, Freeman, Holzinger and Miichell's, also 
Burks*s, studitj of foster children were published, ^ggesting that 
the adoption of orphans into good foster hom& brought about 
significant, even if limited, improi^ements in their mean IQs, The 
work^of Hugh Gordon (1923) with canal boat and gipsy children, 
also of N b M Hirsch (1928) ^with children living in isolated 
rural regions of Kentucky, strengthened the suspicion that IQs 
were more susceptible to environmental advantage or disadvantage 
than such pioneers as Terman and Burt believed. Then in 1937, 
Newman, Freeman and Holzinger published the first investigation 
of identical twins reared apart« which, demonstrated that the 
correlation among pairs was higher than that among non^ 
identicals (thus demonstrating genetic influence), yet it was con^ 
siderably lower than the correlation among pairs reared together 
(presumably on account of their, different environment^). Figure 1 
shows Erlenmeyer^Kimling and Jarvik*s collection of correlations 
between related pairs, and the bottom four rows clearly illustrate 
the effects of genetic and environmental factors, yAt about the 
same time, R L Thorndike (1933) pointed out that/lQs are much 
l^ss stable over time than was generally believed. Although 
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tndtvtclual tests given a week apart correlated at about .95r the' 
reliabUtty coefficients over 5 years dropped^to around .70. ThuiT 
Mntelligence^was by no means fixed for life; and the later longitu- 
dinal studied by' Bayley (1949) and others showed that develop- 
mental or intelligence quotients obtained in the first 3 years or so 
of life* bore scarcely any relation to later childhood or adult IQ 
(see Table 1)* 

Table I 

Cornel ntions between 10s Iroiti Various Prcsch^t or InleMigcncc Tests. 

Different Ages <lrom N Bayley). " ^ 



firjt No. of Yearsunlil Retest 

test t * ■ U , ' 3 6 13 



3 mths. 
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o.as 


^6.13 


0.02 


1 yr- - V 


0,47 


0.23 


0.13 


0.00 




0.74 


0.55 


0,50 


0,43 


3yrs, 
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0,55 


0.3.1 


4yrs. 




0.71 


0,7.1 


0.70 


6 yrs. 


0.86 


- 0.84 


o.«i 


0.77 . 


7yrs. 


0.88 


0.87 


0.73 


0.80 


9yrs. 
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0.87 ■ 








. OM 





Jn 1949, Hebb's influential book* The Or^musutloti of 
Behaviour, seamed to provide a reconcilation between the opposed 
views of hereditarians and environmentalist^, by distinguishitjg 
what he called Intelligence A and Intelligence B. Intelligence A 
was the genetically determined plasticity of (he central nervous 
system nhich was necessary for^any mental growth; but this could 
not be observed directly nor measured. What wc do observe and 
measure fairly effectively is the current level of all-round mental 
efficiency. Intelligence B, and this depends on the interaction 
between the genetic potential and the stimulating or inhibiting 
effects of the environment in which the individual has been 
reared. Hebb*s experiments on dogs and rats reared in different 
environments supported this interpretation. At about this time 
loOr Piaget's work on child development was becoming widely 
known in Britain and Americar and his book on the Pxychoh^v of 
JnteWgence (1950) likewise stressed the role of physical and sodaP 
environment, a5 well as neurological maturation, in the develop- 
ment of' operational thinking. This interactionist theory is now 
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/'^^'fccepted by almost all psychologists except for a few rabid 
environmentalists; but unfortunately/ of course, it still leaves 
scope for a wide range of opinions on the relative importance of 
hereditary and .environmental determination. 1 will describe later 
^~~^ome ol the more recent, and convincitig. studies of this problem. 
The other main line of attack on intelligence testing came from 
the factor analysts who» ^following Thurstone and Guilford.^ 
claimed thaft intelligence is HQt a unitary general ability^ but a 
collection, of-* numerous indep^endent kinds 'of ability or primary 
factors, Verbal^ SpatiaK Number^ etc. Now the multifactorial 
model works quite well when batteries of varied tests are given to 
selected, homogeneous popdtations such ai> college students* But 
as Thurstone found when he extended his investigations to. 
.younger age groups, one or^more sec^d order factors appeared 
t which* he admitted* corresponded to Spjearman's g\ In an hetero< 
geneous populat]OQ>such as Army recruits, or an ^^ge group of 
/ children* the general factor accounts for at least 50 per cent of 
the variance* and additional more specialised abilities for; not 
more than about 25 per cent. Buft has showh that the *g--J- group 
factor model is mathematically equivalent to the Thurstone 
multiple factor solution* They are just 'altjcrnative* aad mutually 
d convertible* ways of classifying abilities. Thurstone^s view 
appeared to be superior in practicfH utility because it provided a 
^ ' profile of scores on half-a-dozen or more separate ability factors* 
rather thUj;^ just the single global IQ plus some small addition^t 
group factors* But the trouble is that the differen\ial or pure- 
fajctor scores derived from Thurstone*s model juit' did not 
differeatiate. Far too many testees score hi^h on all pFjhem. or 
/ low on all. Thus we find in practice that* despite alL tfie efforts 
put into factorial investigations techniques over the past 70 
years*. almost all applied psychologists working su school, clinic, 
/ or occupational fields make use chiefly of a jingle general test* or, 
quite often* of a two^pronged test tike Wechsler's verbal and per- 
formance* or the'^erbal and tionverbal .Lorge-Thorndike or Alice 

^ Hcint tests, or the ^rbal and quantitative College Entrance Board 

tests. Numerous critics of intelligence testing allege that these 
general tests are worthless because different factorists put forward 
contradictory theories of ability structure; and there is no agree- 
* meut on the nature of intelligence or *g' (cf* Block and Dworkin* ^ 
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1974). But in fact the general factor is so proininent that quanti- 
tative studies of heritability do give pretfy consistent results: 
whereas attempls to study the genetid and environinen'ta) 
components of Thurstone's priinary factor measures have yielded 
hopelessly contradictory results. 

Turning now to the common criticisms raised by people who 
oppose testing, whether for ideological or other reasons: and hm- 
far can these be answered? It is often easy to pick out particular 
test items for ridicule, and to say that these don't measure "real" 
intelligence. Actually, of course, all items will usually have been 
validated against total score, though it is true lhat some get out- 

alike?" Though we yan't reach any precise operational definition 
of intelligence, the evidence from factor analysis and other sources 
does show that items are measuring a consistent and important 
component of ability. But I would have to ai^ree that most of the 
item*fypes have developed haphazardly without sufficiently clear 
rationale* Perhaps we could arrive at a better sampling of menca) 
efficiency through advances in' our knowledge of 'information 
processing, as Resnick and her co-authors suggest in The Nature 
jof Intelligence (1976).^ 

Second: group tests are often given by untrained laymen, under 
poorly controlled conditions; they may be mis*scored and misin: 
terpreted. With this f would tend to agree. Although certainly 1 
have made much use of group tests, their results have always 
seemed to me, too chancy to be of great value tn individual diag- 
nosis. And thii is borne out by the considerably greater stability 
of individual test IQs over Hme. For example, the correlation of 
Binet II yr, IQs with yr. IQs is about ,85 to ,90; whereas 

verbal group tests correlate .75, and nonverbal group tests about 
.60 Isee Table 2). Individual testers are. of course, better trained 
and can standardise administration more 'adequately, though they 
too have been shown to have their idiOsyn^rasiesT such as varying 
standards of evaluating, children's responses. And it is highly 
probable that they are susceptible to halo or expectancy effects; 
that is they judge from what the school tdls them^ or from 
impressions obtained in initial^ conversations, that the child is 
bright or dull, and this influenres their administration and 
scoring. On the other hand there is no justification whatev€fr' for 
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the~t€acherexpectahcy effects alleged-by-RosenthaL-and-Jacohson 
(1968), Their experiinent purporting to show that chUdren who 
are reported to teachers as being bright obtain significant IQ 
gains over the next few months, was full of technical faults. And 
numerous attempts^at replication havejed to completely negative 
results (see Elashoff and Snow, 1971). 

Table 2 

Correlalions between Verbal and Non-verbal tO at difter«nl Grade levels and 
Termui^ tQs (Hopkins and Bracht). 



Grade Verbal No n- Verbal Combined Binet 

I S3 . _y2= S3 cm. 

2 .5^ .51 .66 (.72) 

4 .75 .56 .72 (-78) 

7 .74 .65 .77 (.82) 

^ .78 .67 .82 (-80) 



Third: test results depend on practice or coaching. Agreed that 
significant rises have been demonstrated; but they are limited, 
and they can be largely overcome by giving all children concerned 
adequate preliminary practice. The individual tester can usually 
guess when the same test has been used recently. Where this 
weakness becomes really serious is in cross-cultural group testing 
of children or adults who have no previous experience of tests. 

Fourth: test results depend on motivation. Obviouslyt it is said, 
the child who is confident* cooperative and interested will do 
better than one who is anxious, bored, distractible, or has a 
negative self-concept^ Actually it has been difficult to. get much 
confirmatory evidence; except in child guidance casei>, where the 
lesler can gauge the child^s cooperation* and can do a good deal 
to build up suitable rapport. She will also usually note if the IQ is 
likely to be unreliable through lack of motivation. But the group 
(ester. I must admit, can do very little to stimulate the motivation 
i>f all the children in a class. Some experiments (eg Benton* 1936) 
have indicated that extra motivation has little effect; for instance 
offering monetary rewards for good performance may stimulate 
children to try more items, but not to get more of them right. We 
,il!;o found in the army that the scores of women recruits w^re not 
affected by minor illn^ses such as colds* feeling otf colour, or 
menstrual periods. 1 would agree that this point too is of much 
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-gfea^er-itnpoiUancfciiLcrQSS'-CjulAural t estin g , where the testees ma y 
be suspicious or anxious about a foreign tester. 

Fifth: I suppose the most common and most misguided 
criticism of intelligence tests is that theymerely measure acquired 
information and skills. Items such as those shown from Binet or 
Wise in Table 3 are often quoted, with the implication that 

Tabfc 3 

Some Specimen Individual Test Items 

]/' Who wrote Romeo and Juliet? 
2, What is a hieroglyphic? 



obviously slum children have had much less opportunity (han 
middle-class children to hear about the fir^t two items; while the 
third represents knowledge of moral conventions in middle-class 
sociely, and the typical response from lower-class children might 
be very different But It is false to say that vocabulary and verbal 
skills are acquired, in the sense that anybody could acquire them, 
if taught. They are- developed in the same way as other aspects of; 
Intelligence B, Children don*t usually know or use difficult words 
unless they have reached the level of mental maturity sufficient to 
understand the concepts to which the word^ refer. Actually E L 
Thorndike published a careful investigation of just this point in 
1927, He gave 3 tests involving vocabulary and informational 
skills, and 3 tests involving verbal and' mathematical reasoning 
skills, to Grade 8 boys, and found that the correlations between 
the two types of test were just as high as those within either type. 
In other words the information tests were measuring reasoning 
capacity as effectively as did tests designed to sample .^jreasoning« 
However I would agree that most test constructors nowadays do 
tend to avoid items which seem to involve cultural bias, and to 
rely more on items whose difficulty depends on complexity of 
information processing. 

Because of the. importance of this topic* I will quote a more 
recent study by Arthur Jensen (1974) of test bias. The Peatiody 
Picture Vocabulary, which appears likely to be culturally loaded, 
and the Raven Matrices, which appears relatively culture-fair, 
were given to 600 white and 500 black pupils in Californian 
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schools. On both tests the whites, as usual, obtained higher mean 
scores, the-black-peFfOFmance-belng-equivalent to UiaLof_whites_a 
year or two younger. But there were no other features of the 
responses tb indicate "any differences between the groups. The 
reliabUitjes and the factorial content of tests and items were 
the same m blacks and whites, and the rank orders of item 
difficulty were almost identical* That is, there were no items^that 
were relatively more or less difficult for one group than the ^her. 
Also the blacks were not handicapped more on vocabulary /than 
on Matrices, Many other. studies have been carried out byjiarious 
authors with college students* showing that Scholastic lAptitude 
_an d ot her t ypes of intellig ence tests predict college grades^mong 
blacks in just the same way as they do among whites (see Hunter 
and Schmidt* 1976), The tests are more difficult for low economic 
class Or minori^ group students, and this difference is doubtless 
in part due to deprived conditions of upbringing* But it is just not 
true that the tests are unfairly loaded against particular sub- 
groups reared within western societies* 

My sixth point arises from the admittedly imperfect reliability 
of intelligence tests. Critics often report atrocity stones abbut 
children whose educational and vocational careers were per- 
manently blighted because they obtained a low IQ soon after 
entry to primary schooling* 1 would not say that this cannot occur 
{particularly if group tests are used)* but no one has compared 
the incidence of faulty prognoses as against the incidence of valid 
ones, where the information obtained by an intelligence test has 
led to mudi more appropriate educational provisions than merely 
relying on parents* or teachers* judgments* In our Calgary school 
system, it is usual for special class or special school children to be 
retested individually every 2 years/ in case they do show sufficient 
improveqpent to justify return to ordinary schooling* 
, Last of these common criticisms: Cronbach (1975) points out 
that the very success of' intelligence and educational testing has 
contributed to their downfall* The public feels threatened because^ 
more and more of their ow^n, or their children's, educational and 
occupational careers are decided by tests, whose, content, and the 
scores they yield, are ktpt secret* Many people would prefer the 
old-fashioned approach of school examinations^ interviews, and 
decisions based on academic and occupational record, with which 



they are familiar* and it is difficult to convince them that these 
are even less accurate. Then too there is the revolt against 
Invasions ofprivacy-at a tiirie wTieh so much pers«^narifrtofiftatiofr" 
is being computerised; thus many parents demand access to the 
psychologist's files. Whereas the psychologist ^regards this infor- 
mation as confidential as the doctor does; and he refuses to give 
out IQs or other scores, because they would so readily be 
misunderstood* Further, how could he carry out olinical exami-. 
nations or treatment without keeping records of family circum* 
stances that the parents might resent? 1 would like to add too 
that psychologists such as Anastasi (1968)i Cronbach (1970), 
Sattler (1974) and others are very well aw^re of the defects and 
dangei^s-^t^tes ting -t o -w h ich-1-ha v e - d r .a wn atrenUnn; nnrf they f\r\ 
their best by their, textbooks and university courses to train 
teachers and future psychologists to improve their usage of tests* 

I will now go on to some of the major investigatiotfs of recent 
years which supply strong evidence of the influence both of 
genetic and environmental factors on children's intellectual 
growth* On the genetic side, the greatest weight has been put on 
kinship data, especially on identical twins rearec! apart. In four 
separate siudies*> which yielded only 122 pairs in all, the mean 
inter-twin correlation was .S2> and this figure can be taken as an 
estimate' of the heritabili^. of IQ, or percentage of , genetic 
variance, the ren^aining 18 per cent being environmental* 
However the largest single V^up was the 53 pairs collected by Sir 
Cyril Burt; and his correlation of *88 suggested that separated 
identicals are almost as much alike in intelligence as identicals 
brought up in the same home* In 1972, L J Kamin, of Princeton 
University, drew attention to certain discrepancies in Burt's pub- 
lished figures, and in 1974 Jensen issued a complete list of all 
such apparent inconsistencies? Several of these were probably mis- 
copyings, but others were more serious, such as reporting 
identical correlations coefficients for different sized groups of 
twins, suggesting that Burt had not bothered to recalculate when 
he gathered additional cases. These lapses have been blown up by 
Kamin <1974), and by Gillie <1976) in the UK/ into an accusation 



* Namdy Newman, Freeman and Holzing^r (1937). Shtelds (1%2), JucUNiclsen 
{I960) and Bun 0%6), 
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that Burt*5 data were faked* and that none of his findings are of 
any scientific value, since we cannot know whether they contain 
, further .distortions. _1. cannot. .my^df regard . Burt*s ..work. as. 
fraudulent; almost alt the dif^covered inconsistencies are so stupid 
that he W9uld surely have made his results much more plausible if 
he had been intentionally iaking, 1 would agree that he was more 
careless about such details than research psychologists are nowa- 
day^: also that he was so strongly wedded to genetic explanations 
that the planning of hi;; investigations, and his methods and 
analyses may have been biased at timers. For example, the tech* 
niqucs he used for assessing the intelligence of adult relatives^ and 
the. procedures of getting what he called children's adjusted IQs, 
HQQm to have been highly Subjective, Vm afraid then that we will 
Ftave to jettison his individuat test data; but tlris does not mea rTT 
as the critic;; claim, that the whole foundations of heritability of 
in telU thence, are washed away. Other, more scrupulous investi- 
gators have obtained figures which do not differ significantly from 
Burt ,s; and the fact that his correlation for separated identicals is 
higher than anybody else s could very well be due to his pairs 
being children, whereas other workers like Newman and Shields 
used adults with a wide age rang^, whose test scores, therefore, 
ttould certainly be lesN reliable, 

' Kamin s book, Thtf. Sdettce att3 Politics of IQ (1974), doe?i not 
merely attack Burt,. He tnes to pick holes in all other published 
studies that suggest genetic effects: and several reputable 
reviewers have'rejected his methods and conclusions. If Burt was 
biased in one direction, Kamin is much more so in the opposite 
direction. However he does make a ^vcry valid point about 
separated twin studies, namely that such twins^ are never 
randomly assigned to different parents. Although Biirt himself 
denied it, l::am sure that the two members of any pair would 
^'^sually be reared m homes qdite similar in socioeconomic and 
educational ^atus, Hence part of the high correlation within such 
pairs .could.be due . to environmental, and not only genetic, 
similarity. More generally, what is called Genetic-Environmental 
covariance has bee^ neglected by most authors of heritability 
analyse^. This term refers to the obvious likelihood that imelligent 
parents who pass on superior genes to their offspring also usually 
provide above-average environments. However 3 recent studies of 
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-kinship-data by Mortori"(1972)^-Uoehlin cl al-(l**7^)-ant1 Jensen 
{1977) have separated off this eomponent, and you can j;ee from 
Table 4 th at there js fairly clos e con vergence on a genetic 
percentage of around 65* environ menial 23, and covariance 12, 
Note too that the genetie percent is well below the 80 per cent 
which Jensen advocated in 1969 on the basts of Burt*s group tcjft 
figures* I think it quite likely that if we could test larger samptcji 
under more carefully controlled conditions* the genetic percentage 
might drop below 60, But the precise figure doesn*t matter, so 
long as it is recognised that both the genes and the environment 
have very substantial effects on child IQ* and that the c:enetic com- ' 
ponent is probably the larger one* which we cannot aHord to ignore. 
In any case Jensen and hts critics point out that this figure, be it 80 
-per-gent-or-65-per^e nt - or SO - p et-centt i s- ti oL aiuattsoUUtiiiifiLiLjis. 
relative to the particular population such as British or American 
whites^ and would alter if the range of environmental differences 
altered, I have also neglected the complications of Dominance 
effect and of Assortative Mating* since these do not affect the 
main argument, 

Table 4 

Heritabiliiy AnalyMs Including Estimates lor Genet tc*Environ mental Covariance 

' l^eMin 

• ^ Jenclcs et aL Morion Jcnjwn 

Vc . ,45 .61 M 

/ .35 .23 ,19 ^ ,28 

C X E Cbv, ^20 ,15 ,14 ' ,07 

Id view of the many difficulties associated with twin or other 
kinship 4Ata. the chief alternative- approach-is -through foster 
children^ since here we can study environmental effects without 
any genetic connection between child and foster parents Unfor- 
tunately, there ^re still a lot of complications ^ as Munsinger (1975) 
pointed out in a recent review: and the results of different investi* 
gations vary greatly. However there is a fair consensus that 
adoption at an early age. into a good home tends to raise the IQs 
of adoptees, though probably not more than abobt 10 IQ points 
on averag^. The correlation between child JQ and measures of 
foster-parent ability or education^ or the home rating, are mostly 
quite small. From 6 published studies (see Table 5) I find^ a 
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14 . 

ntedian figure of 0,23. and this is almost certainly boosted by the 
tendency for setective placemenif that is the attempt by the 
adopiipn agency to match the child with the foster home level, fn 
sonic studies it was possible to get estimates of the ability of the 
tnie, or biological* parents* though the data are seldom complete* 
fn another 6 sttidtes (see Table 6). I found a median figure of 
0,30 {or probably higher). In spite of Kamin's obsessional efforts 
to demolish the published investigations* we must, I think, 
conclude that the genetic influence of true parents who did not 
rear the children is greater than the environmental effects exerted 
by the foster parenfs. So this line of evidence confirms rather 
neatly the conclusion from kinship analyses. 

Tabic 5 



N 



Freeman cl al. 




^01 


-39 to .52 


Burks 




2\4 


.23 to ,42 


Leatiy 




m 


.18 to ,24 


Skodak & skcels 




139 


.04 to ,20 


Hom & Lo«h1in 




146 


,09 to :i5 


Munsinger 




4] 


-,I4 


Approximate median < ,23 








Table 6 




Corrdations of Foster Child IQs with Ability of Biological Paretit(s) 






Age of 






N 


testees 




Mun^iager 


A\ 


ej 


,70 


Skodak ^ Sk€cU 


63 


131 


,38 to ;44 


Horn St Loehlin 


146 




,32 


Lawrence 


18S 




,26 . 








Uilh rather SHS) 


Skodak ^Skeels 


139 


n 


,23 




70 




,12 



Approximate median " > *30 



There are a number of other types of evidence which contribute 
to the case for genetic determination of intelligence: the fact that 
rats and dogs can be bred to produce bright and dull strains; the' 
tendency for close human inbreeding to yield congenital malfor- 
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Nations and mentaj defect; arid the discovery that specific gene 
anomalies produce psychological syndromes such as Down's and 

- Tu n i er^yr-A-7>oint-yhich--3tnkcs-me-as-h ighly . convinctng-is-that 

children often do iio/ resemble their parents or their siblings in 
intelligence. Resemblances between them (amounting to an 
average correlation of 0,5) could plausibly be attributed to the 
effects of common upbringing. But the fact that professional 
parents can hav^ quite dull children, and lower-cla^s, pooriy 
educated parents have very bright children would be expected on 
genetic though not on environmental theory. 

Let us now look at some of the more striking studies of environ- 
mental effects. Some remarkable investigations in recent years by 
Trevarthen (1974) at Edinburgh, also Schaffer (1977), Bower 
(1974), and Jerome Bruner (1975) have brought out the 
importance of m other-infant interactions in promoting cognitive 
deyelopmerit. Also longituginal fslto w-up, i itvestigations-like-thel 

. Berkeley GrowtK^ Study, and the work at the Fels Institute, have 
yielded substantial correlations between certain types of parental 
handling of children, and their later intelligence. However such 
results are difficult to interpret, since they might also be expli^ 
cable genetically, I have mentioned already the substantial 
genetic-environmental covartance effect, In addittoui children with 
superior genes tend to exploit and even shape their own environ- 
ments more effectively than dull ones. They are more interested in 
readingi ask more questions^, and explore more actively. In other 
words, the genes may affect the environment rather than the 
environment affecting the intelligence. Hence the evidence from 
intervention studies is more convincing, where children are 
submitted to specially stimulating or deprived environments* and 
can be compared for intelligence with control groups,. 

Several cases of severe deprivation have been described in the 
literature, that is children who grew up in a highly restricted 
environment and had hardly any contracts with other human 
beings,^ As would be expected from interaction ist theory* their 
mental development scarcely went beyond the imbecile level. 
Nevertheless" Koluchova's (1972) pair of twin boys* and other 
cases, who were rescued and put into good foster homes as late as 

. 7 yeajs of age overcame their retardation. Their IQs rose from 40 
or below to 100 or above. Thus a highly unfavourable environ- 
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niciU ui early childhood does not necessarily bring about irremedi* 

able c iamaKg- . — 

In the late 1930s Skeels tested 24 orphans in a very unstimu- 
luting institution, at ages 7 to 30 months* Thirteen of them were 
then transferred to a hospital where they received a lol of care 
^and individtJal attention from mnitally defective girls; and most 
of them were later fostered but into lower midd1e*c1ass homes, 
Skeels claimed a lairge improvement averaging 27} IQ points in 
>^cse transferred cases, whereas in the other 11, left in the 
original institution, there was a further drop of 26 points. Some 
25 years later Skeels (1966) traced all these caseSt and found the 
transferred ones to be normal, self-supporting, adults, holding 
qtiite a range of skilled jobs, or else they were married women. 
The non- transferred were still either institutionalised, or in very 
low-grade Jobs* The average number of years of schooling of the 
two groups were 11,7 and 4,0 years respectively. Though I would 
noUpuL-iuiy_cre dence in in telligence tests given at such an early 
age. it seems reasonable to conclude that, as adults, v the trans- 
ferred cases averaged at least 30 IQ points higher than the others* 
Probably the best controlled study is that of Heber and Garber 
in Milwaukee (cf Garber and Heber. 1977), though insufficient 
details have so far been published for us to evaluate the findings. 
F^y negro boys wer^selected at birth whose mothers scored 80 
IQ or belowt and who lived in a very poor neighbourhood. Twenty 
of them — the Experimental group — attended a Centre for 7 
hours a day> 5 days a week* and underwent an all-out programme 
devised to improve their sensor i -motor* language and thinking 
skills. Simultaneously their mothers were given an educational 
programme including home-making, child care/ and vocational, 
training. The other 20 were brought up at home* but took the 
same periodic tests as the Expertmentals, Up to about the age of 
14 months the two groups remained closely parallel on the Gesell 
scale; but the Controls began to falLbehind after 18 months (see 
Figure 2). On preschool scales given between 2 and 41 years, 
Heber found mean IQs of 122,6 and 95*2, that is. a superiority of 
27,4 points among the Experimental, Up to the age of 6 the 
Experimental means stayed between 110 and 120» whereas the 
Controls dropped to around 85. The special programme ceased 
when the children entered first grade,. By, ages 8 to 9, the Experi- 
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a Rg. 2. TQs of Heber and Carter's Experirnenial and Control Groups. 
(Note; The last two points at 7 and ^ years are approximate only, being ba^d on a lecttire. not a ptihtishcW artidt^J 



mentals dropped to an average of 104, while the Controls now 
averaged 80. Final figures are not yet available and, in the 
-absence-of"furthec_stin mlation. it is po ssible that the Expert- 
mentats-may show some further deciine. But clMrly~they-have-a 
tremendous advantage over the Controls reared in their own 
homes, . 

These results are all the more striking when compared with the 
, virtual failure of the well-known Head Start experiment in the 
1960s to produce any permanent gains. However, the Head Start 
programmes' usually^ amounted only to a few months' attendance 
at a nursery school for a few hours a day, before entering elemen- 
tary school. Since then* numerous intervention studies have been 
based more on helping the mothers to stimulate and interact 
better with their 1 to 4 year old children, by,means of home visits 
from psychologists or specially trained teachers, Levenstein's 
(1970), Karnes and Teska's (1970), Bronfenbrennef s (1974), and 
other programmes have produced gains of some 13 to 20 IQ 
points, though there is not much evidence yet of the pePm£^nence 
of the effects. Another advantage of such schenies is that they are 
-relatively inexpensive* probably costing less than Head Start; 
wh^eas the Milwaukee experiment was. far too expensive ever to 
be applied to large numbers of children, ^ 

We might expect even more effective intervention if black* or 
American Indian, children, for example, were adopted shortly 
after birth by white parents, and reared in a superior white 
envircSnment^ Several reports have appeared suggesting that such 
children grow up to be as intelligent as white children in their 
Own families; but usually there is no control over selective place- 
ment, and the true parents could well have been of above average 
intelligence (see Loehlin. Lindzey and Spuhler, 1975), The most 
thorough study is that of Scan- and Weinberg (1976) of 99 negro 
or-part-negro children whose true .parents were of about average 
education for the area^ Thus the children. If reared in their own 
homes, would have obtained a inean IQ of about 90. Twenty-nine 
children were known to have 2 black parents^ and when tested 
after the age of 4 in the foster homes, their mean was 97, Sixty- 
eight others with one black, one white, parent averaged 109, But 
white children of the'adoiftive parents averaged 118, Apparently 
' then the improved environment doe^ produce a substantial gain» 



though. the white-bJack mean difference is certainly not wiped out. 
Thus more data of the same kind are urgently needed* 1 should 
add that I am not taking any stand myself on the issue of racial 
differences in intelligence, since it is much more difficult than it 
is with individual differences to assess separately the j^enetic and 
environmental components. I see no reason why there should not 
be some innate psychological characteristics^ just as there are 
'ntiate^phystcaJ differences, but I would expect them to be small 
in comparison wiTTTthe— veFy-Jarg^ cultural and environmental 
differences. Also, minority group childrecPare"^much-more sus- 
ceptible to pre- and peri-natal "^handicap* and to malnutrition* 
than white children; and this might explain part of the difference. 
Unfortunately no one has yet been able to define the precise 
environmental factors^ which particularly handicap minority ^roup 
children; hence environmentalist theories are almost entirely 
speculative. As Urbach (1974) remarked: ''Everything in the 
world can be explained by factors that we know nothing about \ 

Although the total evidence of environmental effects on 
individual intellectual growth is very Strang* the amount of 
improvement in fQ brought atfout .by fostering or intervention 
does not exceed what would be expected from the figure of 20 to 
30 per cent environmental variance that I suggested earlier. True 
there are ^ccasionaKcases like Koluchova*s twins who register far 
greater changes, but they are so abnormal that they can hardly be 
said to inyalidate theheritability analyses that 1 referred to earlier. 

Lastly^ let us ask what of the future? I will be neither sur- 
prised* nor sorry, if group tests of chUdren^s- intelligence dis- 
appeart particularly within elemj^ntary or primary; schools* There 
is likely to be much less criticism bj[ educationists and parents of 
instruments called VerbaNor Nonverbal Reasoning tests — that is 
the name which Moray House adopted many years ago. Likewise 
the Scholastic Aptitude tests of the American College Entrance 
Board* or so-called factor tests such as verbal* reasoiiiing, spatial, 
etc are more acceptable, and measure very much the same thing* 
They .can be useful in rough t^orting of secondary school students 
by ability* or for admission to American universities* and for 
some occupational selection as in the Armed Services. But we 
must discourage 'the notion that they measure,, a single* global, 
innate inteUigence* or that they predict by themselves the educa-' 
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tiona] 'or occupational potential of the testees. They should be 
regarded as' only one b^c of'evidence whose ^idity forpa^icular 
purposes has been demonstrated. That 6^*~ihey st^ould be 
combined with evidence from other sources such scholastic or 
work record, interview judgments, or special aptitude tests ^uch 
as mechanical^ clerickl, etc. Also their low reliability for long-term 
predictions should be borne in mind: Where the test content or 
format appear unfair to minority groups such as American 
blacks, ,or OE^oured. im^migrants to Britain, their validity and 
regression ^cfoefficients should be specially investigated. Group 
tests will also continue to be useful as control variables, in educa- 
tional or psychological researched, .le for assessing the repre- 
sentativeness of a population sample, or for matching contrasted 
samples, Since we are concerned with group ch'^racteristics in this 
context, thy limited efficiency of the tests for individual diagnosis 
is less of a/arawback. 

Next, r don't foresee any^rious problem, in using individual 
tests lik^ WAIS with adutt clinical patic^nts; though it would be 
useful to have other more diagnostically valid tests like the Reitan 
battery to supplements But I am more dubious about Stanford^ 
Binet and WISC or WPPSI for children, since these are so closely 
associated with the concept of innate potential. Besides the lay.. 
, critics, a great many ^psychologists seem to think they have 
outlived their, usefulness. The view is commonly expressed tiiat 
they give an over-static picture of the developing child; they 
merely measure end-products, rather than psychological 
proce&&ei>, and thus throw no light on how the retarded or malad- 
justed child reachedjhat state, and how he might grow out of it. 
However the available alternatives ^hich,^re supposed to tell us 
more about processes seem to consist mainly of Piag^tian tasks, 
ITPA, etc, whose shortcomings I mentioned earlier. Moreover the 
well-trained Binet 'or Wechster tester does fihd oiit quite a -lot 
^about processes from his ^ qualitative observations of the child's 
ways of tackling items ahd the kinds of errors he makes. True, ^ 
these judgments are subjective, and I .would entirely agree that 
there is room for other supplementary tests to tell us more about . 
children's cognitive styles and strategies,, and specific learning 
disabilities, if some one would invent them. Those, psychologists 
who simply condemn all intelligence testing could do a con- 



structive and useful job by producing some better diagnostic tools. 
1 do not see any particular virtue in retaining the term Intel- 

^ ligence as such/ except that the general factor which runs through 
a wide variety of cognitive skills is too large to be ignored, and 
tht^t be called something: And it does undQubtetily Recount for a 
great deal of the variance iri cleverness versus dullness of children, 
either in everyday lifep or at school.^ But it might be preferable to. 
use a series of factor tests, like the McCarthy scales (1972), which 
yield Indices for Verbal* Perceptual ^Performance* Quantitative, 
pMemory and Psychomotor abilities. The first three of these are 
combined to give a General Cognitive Index. These are limited to 
the 2i to 8i year range. The new British Abili^ Scale (Elliott, 
1974) covers from 2i to 17 years, but it includes over 20 

^separately st^rhdardised tests, from which the tester can select 
those thought relevant to the particular child. The combined score 
on four de^gnated tests yields a conventional IQ- So far there 
seems to be no evidence whether school or clinical psycholo^sts 
regard either of these modem tests as preferable instruments to 
the'Binet or Wise. 

Several writers like Robert Glaser (1977) and Benjamin Bloom 
(1976) take the view that IQ tests are predictive of achievement 
only in a moholithic educational system. If we could provide more 
adaptive or jindividualised programmes to suit each particular 
chUdp the correlatioh would be much reduced* and we could make 
better predictions by means of a series of criterion^referenced 
testSp to. show just what stage a child has reached in e&ch subject/ 
and what he ts ready to go on to next. Their work is impressive, 
but it seems to me that its applicability is limited mainly to arith- 
metic* perhaps natural sciences, and early stages in English. Also 
I suspect that there will still be wide individual differences in 
overall rate of progress, and that refusal to measure this general 
factor would amount to throwing away the baby with the bath 
water. WhUe it is true 'that both intelligence and educational 
achievement tests depend ^'{i^y on genetic factors* partly on 
home and school environment* it is not true that they bolh 
measure the same thing. Many investigations have shown that the 
heritability oi measured intelligence ts a good deal, higher . than 
that of achievement. Intelligence refers to the general reasoning 
and other cognitive capacities which are developed largely by 
p 
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stimulation received in the home and in Idsure hours or peer- 
group activities; whereas achievement refers to the more 
specialised performance in school subjects which depends greatly 
on the quality of' teaching and on children's motivation to leam. 
Nevertheless the intelligence factor gives us useful educational 
predictions in so far as children may usually be expected to be 
able to apply the reasoning capacities built up outside school to 
tackling any new topic in school. 

This is very different from the belief that !Q tests measure 
innate ability^ and educational tests measure acquired knowledge 
a belief which is still far too commonly held by many teachers^ 
and even by some educational psychologists* But if we accept, as I 
have been arguing, that genetical and. environmental influences 
are both very much involved in the development of human intelli- 
gence, then it may be concluded that our tests, regardless of 
whether they ar^ called intelligence tests or something different, 
will continue to be of immense value in diagnosing children's 
strengths and weaknesses in all areas of Special Education. 
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